Using AR HMM state-dependent filtering for speech enhancement

نویسندگان

  • Driss Matrouf
  • Jean-Luc Gauvain
چکیده

In this paper we address the problem of enhancing speech which has been degraded by additive noise. As proposed by Ephraim et al., autoregressive hidden Markov models (AR-HMM) for the clean speech and an autoregressive Gaussian for the noise are used. The filter applied to a given frame of noisy speech is estimated using the noise model and the autoregressive Gaussian having the highest a posteriori probability given the decoded state sequence. The success of this technique is highly dependent on accurate estimation of the best state sequence. A new strategy combining the use of cepstral-based HMMs, autoregressive HMMs, and a model combination technique, is proposed. The intelligibility of the enhanced speech is indirectly assessed via speech recognition, by comparing performance on noisy speech with compensated models to performance on the enhanced speech with clean-speech models. The results on enhanced speech are as good as our best results obtained with noise compensated models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Speech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering

This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...

متن کامل

HMM-Based Speech Enhancement Using Pitch Period Information in Voiced Speech Segments

An extension of the HMM-based speech enhancement approach [1] is presented. The HMM-based scheme uses hidden Markov models (HMM) to control a state-dependent Wiener filter, which is used to process the noisy speech signal. This scheme gives enhanced speech signals without the annoying tonal artefacts (‘musical noise’) of the spectral subtraction approach. However, parts of the enhanced signal o...

متن کامل

Codebook Constrained Wiener Filtering for Speech Enhancement - Speech and Audio Processing, IEEE Transactions on

We propose a modified hidden Markov model (MHMM) thatincorporates nonparametric state duration and state duration-dependentobservation probabilities to reflect state transitions and to have accuratetemporal structures in the HMM.In addition, to cope with the problem that results from the use ofinsufficient amount of training data, we propose to use the modifiedcontinuous...

متن کامل

Subband Kalman Filtering for Speech Enhancement

Kalman filtering is an effective speech-enhancement technique, in which speech signals are usually modeled as autoregressive (AR) processes and represented in the state-space domain. Since AR coefficients identification and Kalman filtering require extensive computations, real-time implementation of this approach is difficult. This paper proposes a simple and practical scheme that overcomes the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999